Temporal Evolution of Financial Market Correlations 
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We investigate financial market correlations using random matrix theory and principal component 
analysis. We use random matrix theory to demonstrate that correlation matrices of asset price 
changes contain structure that is incompatible with uncorrelated random price changes. We then 
identify the principal components of these correlation matrices and demonstrate that a small number 
of components accounts for a large proportion of the variability of the markets that we consider. 
We characterize the time-evolving relationships between the different assets by investigating the 
correlations between the asset price time series and principal components. Using this approach, 
we uncover notable changes that occurred in financial markets and identify the assets that were 
significantly affected by these changes. We show in particular that there was an increase in the 
strength of the relationships between several different markets following the 2007-2008 credit and 
liquidity crisis. 



I. INTRODUCTION 

The global financial system is composed of a variety 
of markets, which are spread across multiple geographic 
locations and in which a broad range of financial prod- 
ucts are traded. Despite the diversity of markets and the 
disparate nature of the products that are traded, price 
changes of assets often respond to the same economic an- 
nouncements and market news [H-Q • The fact that asset 
prices depend on the same signals implies that there is 
strong coupling between prices, so price time series can 
exhibit similar characteristics and be correlated. One of 
the primary concerns of market practitioners is to esti- 
mate the strength of such correlations Q. 

There are many reasons for wanting to understand cor- 
relations in price movements. Perhaps the most famil- 
iar motivation is for risk management purposes, because 
large changes in the value of a portfolio are more likely 
if the prices of the assets held in the portfolio are corre- 
lated p|. An understanding of the correlation between 
financial products is therefore crucial for managing in- 
vestment risk. It has also been shown that the strength of 
the correlations between some markets can be explained 
by macroeconomic factors [fl Q. An understanding of 
correlations can therefore illuminate the macroeconomic 
forces driving markets and help inform asset allocation 
decisions [f|. 

In this paper, we use principal component analysis 
(PCA) to produce a parsimonious representation of mar- 
ket correlations and characterize the evolving correlation 
structures within markets. PCA is an established tool in 
data analysis for generating lower-dimensional represen- 
tations of multivariate data Q and has provided useful 
insights in a diverse range of fields [a-tllj. In finance, 



PCA has been used to identify common factors in inter- 
national bond returns [THHH and produce market indices 
[l4| . It has also been used in subjects such as arbitrage 
pricing theory [l6[ and portfolio theory [l?} . 

PCA is closely linked to random matrix theory (RMT), 
which was developed to deal with the statistics of the 
energy levels of many-body quantum systems [H, [l9| . 
The standard financial application of RMT is to com- 
pare the eigenvalues and eigenvectors of correlation ma- 
trices of asset returns with the corresponding properties 
of correlation matrices for randomly distributed returns 
P. [20l . [2lll43 - l4r| . In prior studies, most of the eigenvalues 
of market correlation matrices were found to lie within 
the ranges predicted by RMT (for example, in Ref. Q, 
94% of the eigenvalues lie within the RMT range), which 
is usually taken as an indication that to a large extent 
the correlation matrix is random and dominated by noise 
[H US HH] • I n addition, the smallest eigenvalues of market 
correlation matrices were found to be most sensitive to 
noise in the estimation of the correlation coefficients Q ■ 
Because the eigenvectors corresponding to the smallest 
eigenvalues are used to determine the least risky portfo- 
lios in Markowitz portfolio theory [22j , this result has im- 
portant implications for risk management id, MM- 

Most prior investigations of markets using PCA and 
RMT focused on specific markets. For example, there 
is a large body of work investigating equity markets 
sin l2ll [23| , and there have also been investigations of 
emerging market equities [24T - [26j . the foreign exchange 
(FX) market [27|, and bond markets [U, El- Many of 
these studies only consider a single correlation matrix, 
although some also investigate the temporal evolution of 
correlations 0, EE| . The work in this paper differs from 
these studies by investigating a diverse range of asset 
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classes and by studying the time evolution of the corre- 
lations between these assets. By studying the time dy- 
namics of the correlations of many different markets, we 
uncover periods during which there were major changes 
in the correlation structure of the global financial system. 
We find that there was a large increase in the strength of 
correlations between many different assets following the 
2007-2008 credit crisis 28j, which has important impli- 
cations for the robustness of financial markets [29j . Fi- 
nancial institutions are linked both through credit rela- 
tionships and as a result of holding similar portfolios of 
assets [301. ^ many assets are correlated and prices fall, 
this can cause several financial institutions to write down 
the value of their assets. These write-downs can then 
impact the credit relationships between different institu- 
tions [3l|. The strength of correlations therefore affects 
the stability of markets. 

The rest of this paper is organized as follows. In Sec- 
tion UH we discuss our data. We then discuss correlations 
between assets in Section Hill PCA and RMT in Section 
IIV1 the temporal evolution of assets in Section [V] cor- 
relations between individual assets and components in 
Section I VII and individual asset classes in Section IVIII 
We summarize our results in Section IVIII1 We enumer- 
ate the assets that we considered in Appendix provide 
examples of the changes in the correlations between as- 
sets and principal components (PCs) in Appendix IB! and 
consider the contribution of the assets to these correla- 
tions in Appendix [Ul 



II. DATA 

We examine time series for N = 98 financial products 
for the period 01/08/1999-01/01/2010. These products 
include 25 developed market equity indices, 3 emerging 
market equity indices, 4 corporate bond indices, 20 gov- 
ernment bond indices, 15 currencies, 9 metals, 4 fuel com- 
modities, and 18 other commodities. (We enumerate and 
describe these assets in Table Q] of the appendix.) We in- 
clude markets from several geographical regions, so many 
are traded during different hours of the day. For exam- 
ple, stocks included in the Nikkei 225 are traded on the 
Tokyo Stock Exchange, which operates from midnight to 
6 AM GMT, whereas stocks included in the FTSE 100 
index are traded on the London Stock Exchange, which 
operates from 8 AM to 4:30 PM GMT. We use weekly 
price data to minimize effects that might result from the 
different trading hours for markets from different time 
zones. 
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We define a standardized return as Zi(t) = [zi(t) — 

(zi)]/a(zi), where cr(zi) = \J (zf) — (zi) 2 is the standard 
deviation of Zi over a time window of T time steps and 
(• ■ ■ ) denotes a time average over the same time window. 
We then represent the standardized returns as an N x T 
matrix Z, so the empirical correlation matrix is 



1 ~ a rp 
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which has elements r(i,j) <G [—1,1]. Because we have 
standardized the time series, the correlation matrix R of 
returns Z is equal to the covariance matrix of Z. 

We create a time-evolving sequence of correlation ma- 
trices by rolling the time window of T returns through the 
full data set. The choice of T is a compromise between 
overly noisy and overly smoothed correlation coefficients 
[HJ and is usually chosen such that Q = T/N > 1. Fig- 
ure [1] shows that we identify the same major changes in 
the distribution of correlation coefficients using different 
values of T; however, some of the features in the correla- 
tions are smoothed out for longer windows. In this study, 
we fix T = 100 (each window then contains just under 
two years of data and Q = 1.02), and we roll the time 
window through the data one week at a time. By only 
shifting the time window by one data point, there is a 
significant overlap in the data contained in consecutive 
windows. However, this approach enables us to track the 
evolution of the market correlations and to identify time 
steps at which there were significant changes in the corre- 
lations. The choice of T = 100 results in 475 correlation 
matrices for the period 1999-2010. 




2002 2004 2006 2008 2010 2002 2004 2006 2008 2010 
T= 100 T= 150 T = 200 



III. CORRELATIONS 



We denote the price of asset % at discrete time t as Pi (t) 
(i = l,...,iV) and define a weekly logarithmic return 



FIG. 1: (Color online) Comparison of the distribution of cor- 
relation coefficients as a function of time for time windows of 
length T = 100, 150, and 200. (a) Mean correlation fi(r), (b) 
standard deviation u(r), (c) skewness y(r), and (d) kurtosis 
ft(r). 
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In Fig. [2l we show the distribution of all of the em- 
pirical correlation coefficients from every time window. 
To highlight interesting features in the correlations, we 
compare the distribution to corresponding distributions 
for simulated random returns and randomly shuffled re- 
turns. We generate shuffled data by randomly reordering 
the full return time series for each asset independently 
This process destroys the temporal correlations between 
the return time series but preserves the distribution of 
returns for each series. We then produce correlation ma- 
trices for the shuffled returns by rolling a time window 
of T time steps through the shuffled data and calculat- 
ing a correlation matrix for each position of the win- 
dow. We produce simulated data by independently gen- 
erating N time series of returns (where each series has 
the same length as the original data) whose elements are 
drawn from a Gaussian distribution with mean zero and 
unit variance. We again roll a time window of length T 
through the data and calculate a correlation matrix for 
each window [5lj |. 

Figure [2] illustrates that the distribution of correlation 
coefficients for the market data is significantly different 
from the two random distributions as the market data has 
more large positive and negative correlations. The differ- 
ences between the distributions demonstrate that there 
are temporal correlations between returns for financial 
assets that are incompatible with the null models that 
we consider, which in turn implies that financial mar- 
ket correlation matrices contain structure that warrants 
investigation. 
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FIG. 2: (Color online) Distribution of all of the correlation 
coefficients r(i,j) from every time window for market and 
random data. The shuffled and simulated data lie almost on 
top of each other. 



transformation O that maps a set of observed variables 
Z into a set of uncorrelated variables Y Q. We define 
the N x T matrix 



Y = ftZ 



(3) 



where each row (k = I, . . . , N) corresponds to a PC 
of Z and the transformation matrix f2 has elements ujki- 
The first row ui of f2 (which contains the first set of 
PC coefficients) is chosen such that the first PC yi is 
aligned with the direction of maximal variance in the N- 
dimensional space defined by Z. Each subsequent PC 
accounts for as much of the remaining variance of Z as 
possible, subject to the constraint that the Wk arc mu- 
tually orthogonal. We further constrain the vectors u>k 
such that uikUfr = 1 for all k. 

The correlation matrix R is an N x N diagonalizable, 
symmetric matrix that can be written in the form 



R = — EDE T 

T 



(4) 



where D is a diagonal matrix of eigenvalues and E is 
an orthogonal matrix of the corresponding eigenvectors. 
It is known Q that the eigenvectors of the correlation 
matrix correspond to the directions of maximal variance 
such that f2 = E T , and one finds the PCs via the diago- 
nalization in Eq. The signs of the PCs are arbitrary; 
if the sign of every coefficient in a component y^ is re- 
versed, neither the variance of y^ nor the orthogonality of 
u>k with respect to each of the other eigenvectors changes. 

We compare the properties of the market correlation 
matrices with correlation matrices for random time se- 
ries. The correlation matrix for N mutually uncorre- 
lated time series of length T with elements drawn from 
a Gaussian distribution is a Wishart matrix [H, H(J . In 
the limit N — > oo, T — > oo, and with the constraint that 
Q = T/N > 1, the probability density function p(j) of 
the eigenvalues 7 of such correlation matrices is given by 



/ \ Q Vh+ -7) (7- -7) ft .s 
pvy) = - — ^7 ( 5 ) 



27T<7 2 (Z) 



7 



where cr 2 (Z) denotes the variance of the elements of Z, 
and 7+ and 7^ are the maximum and minimum eigen- 
values of the matrix, which are given by 



7± 



(Z) 1 



1 



±2 



(6) 



IV. PRINCIPAL COMPONENT ANALYSIS 
AND RANDOM MATRIX THEORY 

We investigate the structure of the correlation matri- 
ces using PCA. The aim of PCA is to find the linear 



When Q = 1, the minimum eigenvalue is 7_ = 0, the 
maximum eigenvalue is 7+ = 4cr 2 (Z), and the density 
p(j) diverges as l/v/7 as 7 — >• 7_ =0. Equations (|5|6j) 
are only valid in the limit N — > 00. For finite N, there is a 
non-zero probability of finding eigenvalues larger than 7+ 
and smaller than 7_. For the returns that we investigate, 
cr 2 (Z) = I and 7+ = 3.96. 
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In Fig.[3l we compare the distribution of all eigenvalues 
from every time window for market data with the distri- 
butions for shuffled and simulated data. In panel (a), 
we show that the eigenvalue distribution for market cor- 
relations differs from that of random matrices: there are 
many eigenvalues larger than the upper bound 7+ = 3.96 
predicted by RMT (with several eigenvalues almost 10 
times as large as the upper bound). In prior studies of eq- 
uity markets, the eigenvector corresponding to the largest 
eigenvalue has been described as a "market" component, 
with roughly equal contributions from each of the N eq- 
uities studied, and the eigenvectors corresponding to the 
other eigenvalues larger than 7+ have been identified as 
different market sectors [lUl. In Section ED we dis- 
cuss the interpretation of the observed eigenvectors with 
eigenvalues j3 > 7+. For now, we simply note that the de- 
viations of the empirical distribution of eigenvalues from 
the predictions of RMT again imply that the correlation 
matrices contain structure that is incompatible with the 
null models that we consider. 

In Figs. [3^b) and (c), we illustrate that the distribu- 
tions for shuffled and simulated data are very similar and 
that they agree very well with the analytical distribution 
given by Eq. ([5]) over most of the range of 7. In particu- 
lar, both distributions have an upper bound close to the 
theoretical maximum 7+ = 3.96. However, for Q = 1.02 
(the value that corresponds to the selected T and N), the 
observed distribution of eigenvalues for random data docs 
not fit the distribution in Eq. J5]) as 7 — > 0. For both the 
simulated and shuffled data, we observe a much higher 
probability density near 7 = than that predicted by 
RMT. The high probability density near zero is a result 
of the fact that T w N. When we simulate eigenvalue 
distributions for data with T 3> N, we observe a much 
smaller probability density near zero. In Figs. EJb) and 
(c), we also show the theoretical distribution for Q = 1. 
In this case ^0(7) diverges as 7 — > 0, which fits the ran- 
domly generated distributions reasonably well. 
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FIG. 3: (Color online) Distribution of eigenvalues p(/3) of the 
correlation matrices for all time windows for (a) market, (b) 
shuffled, and (c) simulated data. The insets show the distri- 
butions of the largest eigenvalues. In (b) and (c), we show 
the eigenvalue probability density functions p{^) for random 
matrices given by Eq. ((5| for Q = 1.02 and Q — 1. 



We obtain similar results for the distributions of the 
elements u>ki (the PC coefficients) of the eigenvectors 
of the correlation matrices. Correlation matrices R arc 
real symmetric matrices, so we compare the eigenvector 
properties of the matrices R with those for real symmet- 
ric random matrices. Such random matrices display the 
universal properties of the canonical ensemble of matri- 
ces known as the Gaussian orthogonal ensemble (GOE) 
[13, Ull- For the GOE, the probability density p(u k ) of 
the elements of the k th eigenvector is a Gaussian distribu- 
tion with mean zero and unit variance [HJ • We find that 
the distributions for shuffled and simulated data closely 
match a Gaussian distribution, but there are differences 
between these distributions and the distributions for mar- 
ket correlations. These differences arc most pronounced 
for the first and second PCs. In particular, there are 
asymmetries in the distributions for market data that 
are not present in the random distributions. 



V. TEMPORAL EVOLUTION 

In prior sections, we studied aggregate results for all 
time steps and illustrated that the eigenvalues and eigen- 
vectors of financial market correlation matrices suggest 
that there are correlations that are incompatible with the 
random null models that we consider. We now investigate 
the temporal evolution of financial market correlations by 
investigating changes in the eigenvalues and eigenvectors. 

A. Proportion of Variance 

We begin by considering the eigenvalues of the corre- 
lation matrices. The covariance matrix Sy for the PC 
matrix Y can be written as 

s Y = 1yy t = ±nzz T n T = d , (7) 
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where D is the diagonal matrix of eigenvalues j3. The 
total variance of the returns Z for the N assets is then 

N N N 

J2 = tr(E 4 ) = 5>=E = tr ( D ) = N > 

i—1 i—1 i—1 

(8) 

where is the covariance matrix for Z and cr 2 (z;) = 1 
is the variance of the vector of returns for asset i. The 
proportion of the total variance in Z explained by the k th 
PC is then 

o- 2 (yfc) = Pk = Pk (Q) 

In words, the ratio of the fc th largest eigenvalue (3k to 
the number of assets ./V is equal to the proportion of the 
variance from the fc th PC. 

In Fig. |4l we show as a function of time the fraction of 
the variance /3k/N due to the first five PCs (k = 1, . . . , 5). 
From 2001 to 2004, the fraction of the variance explained 
by the first PC increased. Between 2004 and 2006, it 
decreased before gradually increasing again. In par- 
ticular, a sharp rise occurred when the week including 
09/15/2008 entered the rolling time window. This was 
the day that Lehman Brothers filed for bankruptcy and 
Merrill Lynch agreed to be taken over by Bank of Amer- 
ica. Both events represented major shocks to the finan- 
cial system [28| . The variance explained by the first PC 
peaked as the week ending 12/05/2008 entered the rolling 
window — this was the week during which the National 
Bureau of Economic Research officially declared that the 
U.S. was in a recession — at which point it accounted for 
nearly 40% of the variance in Z. 

The large variance in market returns explained by a 
single component implies that there is a large amount 
of common variation in financial markets. The increase 
in the variance accounted by the first PC between 2001 
and 2010 also suggests that markets have become more 
correlated in recent years. In particular, the significant 
rise in the variance of the first PC following the collapse 
of Lehman Brothers demonstrates that markets became 
more correlated during the period of crisis following the 
failure of this major bank. 

Although the changes in the variance from other high 
PCs are smaller than those for the first PC, from 2001 
until the collapse of Lehman Brothers, the variance ex- 
plained by the second and third PCs appears to be nega- 
tively correlated with the variance explained by the first 
PC. This is to be expected because the total variance is 
constrained to sum to N [see Eq. ((SJ)], so when the first 
PC accounts for a higher proportion, less remains to be 
explained by the other components. Following Lehman's 
bankruptcy, there is a sharp increase in the variance ex- 
plained by the first PC, for which the small decreases in 
the variances of the next four PCs do not account. In- 
stead, the increase in the variance of the first PC arises 
from small decreases in the variance of many other PCs. 
From 09/19/2008 to 12/05/2008, the variance explained 



by the first PC increased by 10%. Over the same period, 
the variance explained by only four other PCs increased 
(with a maximum increase of 0.02% for the ninth PC). 
The variances due to all other PCs decreased. For ex- 
ample, the second, third, fourth, and fifth PCs fell by 
0.6%, 1.3%, 0.7%, and 0.5%, respectively. The combined 
decrease in these PCs offset the sharp rise in the first PC. 

It is also instructive to consider the combined variance 
explained by the first few PCs. In 2001, the first twelve 
PCs accounted for approximately 65% of the variance of 
market returns. By 2010, however, the first five PCs ac- 
counted for the same proportion. The fact that only a 
few components accounted for such a large proportion of 
the variance implies that there is a lot of common vari- 
ation in the return time series of many financial assets 
and highlights the close ties between different markets. 
It also suggests that market correlations can be charac- 
terized by much fewer than N components. 




FIG. 4: (Color online) Fraction of the variance in Z explained 
by the first five PCs versus time. The top curve shows the 
variance explained by the first PC, the next curve shows the 
variance explained by the second PC, and so on. The hori- 
zontal axis shows the year of the last data point in each time 
window. 



B. Significant Principal Component Coefficients 

An increase in the variance for which a PC accounts 
might be the result of increases in the correlations among 
only a few assets (which then have large PC coefficients) 
or a market-wide effect in which many assets begin to 
make significant contributions to the component. This 
is an important distinction, because the two types of 
changes have very different financial implications. For 
example, in optimal portfolio selection, it becomes much 
more difficult to reduce risk by diversifying across dif- 
ferent asset classes when correlations between all assets 
increase. In contrast, increases in correlations within an 
asset class that are not accompanied by increases in cor- 
relations between asset classes have a less significant im- 
pact on diversification. A market-wide increase in corre- 
lations might also imply a change in the global macroe- 



6 



conomic environment. If many assets are correlated, this 
suggests that the same macroeconomic force is driving 
different markets. 

We use the inverse participation ratio (IPR) [2ll . |34| 
to investigate temporal changes in the number of assets 
that make significant contributions to each component. 
The IPR I k of the k th PC u k is defined as 

N 

I fe = 5>^ 4 - ( 10 ) 

i=l 

The IPR quantifies the reciprocal of the number of ele- 
ments that make a significant contribution to each eigen- 
vector. Two limiting cases help one to understand it: (1) 
an eigenvector with identical contributions u^i = 1/VA 
from all N assets has I k = 1/N; and (2) an eigenvector 
with a single component = 1 and remaining compo- 
nents equal to zero has I k = 1. We also define a partici- 
pation ratio (PR) as 1/Ik- A large PR for a PC indicates 
that many assets contribute to it. 

In Fig. [SJ we show as a function of time the PR of 
the first three PCs. The PR of the first PC increased 
from 2001 to 2010, and there were sharp increases when 
the weeks ending 05/12/2006 and 09/19/2008 entered the 
rolling time window. The second increase coincided with 
the market turmoil that followed the collapse of Lehman 
Brothers and occurred at the same time as a significant 
increase in the variance that was explained by the first 
PC (see Fig. 0]). The first increase coincided with surg- 
ing metal prices. During the week ending 05/12/2006, 
the price of gold rose to a 25-year high, reaching over 
$700 per ounce, and the prices of several other metals 
also rose to record levels. Platinum and copper reached 
all time highs, aluminum hit an 18-year peak, and silver 
prices rose to their highest levels since February 1998. 
During the same week, corporate bond prices reached a 
2-year high and the prices of emerging market equities 
reached record levels. Although these events coincided 
with a significant increase in the PR of the first PC, this 
increase was not accompanied by a sharp rise in the vari- 
ance explained by this component. 

The sharp rise in the PR of the first PC following the 
collapse of Lehman Brothers implies that many assets 
were highly correlated during the ensuing financial crisis. 
Based on the value of the PR, over 70% of the studied 
assets contributed significantly to the first PC. To test 
the significance of the PR of the first PC, we compare it 
to the corresponding PR for random returns. Figure [5] 
illustrates that between 2006 and 2010, the PR of the ob- 
served returns was significantly larger than that expected 
for random returns, which emphasizes the large number 
of different assets that were correlated during this period. 

The temporal evolutions of the PRs of the higher com- 
ponents are rather different. For example, from 2001 to 
2003, the PR of the second PC doubled; it then fluctuated 
around the same level until the collapse of Lehman Broth- 
ers, at which point it decreased sharply. Similarly, the 
PR of the third PC increased from 2001 until Lehman's 
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FIG. 5: (Color online) Participation ratio [Ik]~ as a func- 
tion of time for the three PCs with the largest variance 
(k = 1,2,3). The horizontal solid line shows the PR (aver- 
aged over 100,000 simulations) of the first PC for randomized 
returns, and the horizontal dashed lines show one standard 
deviation above and below the mean. 



collapse, and then it also fell sharply. This suggests that, 
following Lehman's bankruptcy, the first PC influenced 
many assets at the expense of higher components. The 
dominance of a single PC again implies that there is a 
large amount of common variance in asset returns. It 
also suggests that the key market correlations can be de- 
scribed using only a few PCs. 

From 2001 to 2002, the PR of each of the first three 
PCs was below the value expected for random returns for 
all but 9 weeks. For random returns, there are only small 
differences in the PRs for the different PCs. For example, 
using 100,000 simulations we find that the mean PR of 
the first, second, and third PCs are 38.3, 37.7, and 37.3, 
respectively. The standard deviations are 4.0, 4.1, and 
4.2, respectively. This implies that eigenvectors for cor- 
relation matrices of random returns are extended as many 
different assets contribute to them [2l|. In contrast, from 
2001 to 2002, the eigenvectors for the correlation matri- 
ces for market data are localized and have fewer assets 
contributing to them than expected for correlation ma- 
trices for random uncorrclatcd returns. 

Inspection of the first three eigenvectors over this pe- 
riod suggests that they correspond to bonds, equities, 
and currencies, and the PRs of the first three PCs sup- 
port this observation. We study 24 bond indices (govern- 
ment and corporate), 28 equity indices, and 15 curren- 
cies. At the beginning of 2001 the PRs of these first three 
PCs were 32.6, 40.3, and 14.9, respectively. As expected, 
these PRs are larger than the number of bonds, equities, 
and currencies that we study. The different asset classes 
have some common variance with other types of assets, 
which inflates the PR. Taking this effect into account, 
the PRs are consistent with localized eigenvectors that 
represent specific asset classes. 

From 2001 to 2010, the first PC changed from a local- 
ized state in which only bonds contributed significantly 
to an extended state in which nearly all of the assets that 
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we consider contributed. We discuss this in more detail in 
Section[VU The large PR during the post-Lehman period 
and the small PR from 2001 to 2002 are both indicative 
of correlations that are incompatible with uncorrelated 
asset returns. 



C. Number of Significant Components 

We now attempt to determine how many PCs are 
needed to describe the primary market correlations. PCA 
is widely used to produce lower-dimensional represen- 
tations of multivariate data by retaining a few "signif- 
icant" components and discarding all other components 
0. Many heuristic methods have been proposed for de- 
termining the number of significant PCs, but there is no 
widespread agreement on an optimal approach [35j . 

We apply two techniques to find the number of sig- 
nificant components. The first is the Kaiser-Guttman 
criterion [36j |. which assumes that a PC is significant if 
its eigenvalue j3 > 1/N. Any component that satisfies 
this criterion accounts for more than a fraction (1/N) of 
the variance of the system. It is considered significant 
because it is assumed to summarize more information 
than any single original variable. The second approach 
is to compare the observed eigenvalues to the eigenval- 
ues for random data and can be understood by consid- 
ering the scree plot in Fig. EJa). A scree plot shows the 
magnitudes of the eigenvalues as a function of the eigen- 
value index, where the eigenvalues are sorted such that 
Pi > 02 > • • • > Pn- The leftmost data point in a scree 
plot indicates the magnitude of the largest eigenvalue, 
and the rightmost data point indicates the magnitude of 
the smallest eigenvalue. The number of significant PCs 
is given by the number of eigenvalues in the scree plot for 
which the eigenvalue for the observed data is larger than 
the corresponding eigenvalue for random data. For ex- 
ample, in Fig.^a), there are 3 significant eigenvalues for 
11/27/2009 and 6 significant eigenvalues for 03/09/2001. 

Figures IDJb) and (c) illustrate that there are large dif- 
ferences in the number of significant components iden- 
tified using the two techniques, though both agree that 
the number decreased between 2001 and 2010. The dis- 
crepancies in the results suggest that one cannot reli- 
ably determine the exact number of significant PCs using 
these two methods. Nevertheless, the similar trends ob- 
tained using the two techniques provide evidence that the 
number of significant components decreased from 2001 
to 2010. This again implies that markets have become 
more correlated in recent years. Both methods also agree 
that the number of significant components is much lower 
than the number of assets that we considered. Therefore, 
although one cannot determine precisely the number of 
significant components using the methods described in 
this section, our results nonetheless suggest that market 
correlations can be characterized by much fewer than N 
components. 
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FIG. 6: (Color online) Panel (a) shows a scree plot, which 
gives the magnitude of the PC eigenvalues as a function of the 
eigenvalue index, where the eigenvalues are sorted such that 
Pi > /?2 > . . . ~> Pn ■ We show curves for random correlation 
matrices and for correlation matrices for time windows ending 
on 03/09/2001 and 11/27/2009. The inset zooms in on the 
region in which the two example curves for observed data 
cross the curve for random data. Panels (b) and (c) show 
the number of significant components as a function of time 
determined using (b) the Kaiser-Guttman criterion and (c) 
by comparing the scree plots of the observed and random 
data. 



VI. CORRELATIONS BETWEENS ASSETS 
AND COMPONENTS 

We now return to the question of the interpretation of 
the eigenvectors with eigenvalues j3 larger than the up- 
per bound 7+ predicted by RMT. To do this, we inves- 
tigate the correlations r(zj,yfc) between the asset return 
time series Zj and the PCs y^. These correlations are 
closely related to the PC coefficients, which represent the 
weighting of each asset on the PCs. However, because the 
correlations r(zj,yfc) are confined to the interval [—1, 1], 
they arc easier to interpret than the PC coefficients. We 
use the correlations r(zi, y&) to measure the strengths of 
the asset-PC relationships and to determine which assets 
contribute to each PC. In doing this, we also determine 
the number of PCs that need to be retained to describe 
the main features of the correlation matrices. 

We write the covariance matrix of the return Z with 
PCs Y as 

£ w = -YZ T = -fJZZ T = Ofi T Dfi = DO . (11) 
yz T T 

This implies that the covariance of the returns of asset i 
and the k th PC is given by £(z,,y/-) = UkiPk- Addition- 
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ally, the correlation r(zj,yfc) is given by 



a(zi)a(y k ) 



(12) 



where o-(ij) = 1 is the standard deviation of Zj over T 
returns and cr(yfc) = The correlations between the 

PCs and the original variables are therefore equal to the 
PC coefficients scaled by the appropriate eigenvalue. The 
signs of the PC coefficients are arbitrary, so the signs of 
the PCs and the signs of the correlations r(zi,y^) arc 
also arbitrary. To avoid having to choose a sign for each 
correlation coefficient, we consider absolute correlations 
|r(zj,yfc)|. Although we can then no longer tell whether 
an asset is positively or negatively correlated with a PC, 
this step is reasonable because we are interested only in 
determining which assets contribute to each component. 



A. Assets Correlated with Each Component 

In Fig. we show the variation through time of the 
correlation of every asset with each of the first six PCs. 
(In Appendix [B] we show example plots of |r(zj,yfc)| as a 
function of time for specific assets.) This figure highlights 
that there are significantly fewer high correlations for the 
components with larger k. For example, many of the 
correlations in the first PC are greater than 0.8, but the 
correlations between the asset returns and the sixth PC 
rarely exceed 0.5. As one considers increasingly higher 
components, the maximum correlation decreases until all 
correlations are less than 0.2 for the highest components. 
The low correlations between the asset return time series 
and the higher PCs implies that much of the key structure 
from the correlation matrices is contained in the first few 
PCs. Because the assets contribute to the PCs, some of 
the correlation |r(z,,yfc)| between the i th asset and the 
fc th PC is attributable to asset i. We discuss the effect of 
these "self-correlations" in Appendix [Cl 

Figure [7J demonstrates the changing correlations be- 
tween the different asset classes. From 2001 to 2002, all 
of the corporate and government bonds (except Japanese 
government bonds) were strongly correlated with the first 
PC. Over the same period, most of the equity indices were 
strongly correlated with the second PC and most of the 
currencies were strongly correlated with the third PC; 
six grain commodities (soybean, soybean meal, soybean 
oil, corn, wheat, and oats) were strongly correlated with 
the fourth PC; and fuel commodities were strongly corre- 
lated with the fifth PC. Therefore, over this period, each 
of the first five PCs corresponded to a specific market, 
and the separation into components gave low correlations 
between different assets classes. During 2002, however, 
these relationships began to break down as bonds and 
equities both became strongly correlated with the first 
PC and both types of asset had a correlation of approxi- 
mately 0.5 with the second PC. The strong correlation of 
both bonds and equities with the same PCs marked the 



start of a period during which the coupling between as- 
set classes increased and different markets became more 
closely related. 

We found three major changes in the correlations be- 
tween 2002 and 2009; these are most clearly seen by ex- 
amining the second PC in Fig. [JJ The first change cor- 
responded to a local peak in corporate bond prices; the 
second change corresponded to surging metal prices; and 
the third (and most striking) change occurred following 
the collapse of Lehman Brothers. After they declared 
bankruptcy, the first PC became strongly correlated with 
nearly all assets — including equities, currencies, metals, 
fuels, other commodities, and some government bonds. 
The major exceptions were corporate bonds and (to a 
lesser extent) government bonds, but both sets of bonds 
were strongly correlated with the second PC. During this 
period, only a few assets were strongly correlated with 
the third PC; these included EURUSD, CHFUSD, gold, 
silver, and platinum. Additionally, very few assets were 
strongly correlated with the higher PCs. The strong cor- 
relations between the majority of the studied assets and 
the first PC following Lehman Brothers' collapse further 
demonstrates the strength of market correlations during 
this crisis period and highlights the common behavior of 
nearly all markets. It also suggests that many different 
markets are being driven by the same macroeconomic 
forces. 

Figure [Jj also illustrates that for a system in which 
the first few PCs account for a significant proportion of 
the variance, a consideration of the correlations between 
these components and the original variables provides a 
parsimonious framework to uncover the key relationships 
in the system. Instead of having to identify important 
correlations in a matrix with ^N(N — 1) elements, one 
only needs to consider correlations between the N vari- 
ables and the first few PCs, which reduces the number 
of correlations to consider by a factor of N. Figure [Jj 
demonstrates that this method uncovers the changing re- 
lationships between the different asset classes and high- 
lights assets, such as Japanese government bonds, whose 
behavior was unusual. As we have discussed above, this 
approach also uncovers notable changes that occurred in 
markets as well as the assets that were significantly af- 
fected by these changes. 



B. Financial Factor Models 

Several models have been proposed that attempt to ex- 
plain return time series using linear combinations of one 
or more financial market factors (38l . |39| . Some of these 
models are closely related to PCA, so we discuss them 
briefly. One of the most widely studied factor models is 
the Capital Asset Pricing Model (CAPM) JMIU, which 
relates the return of equities to the returns of a single 
factor-the "market portfolio" -which is usually taken as 
the return of a market-wide index like the S&P 500 [3!| . 
Empirical evidence indicates, however, that the CAPM 




FIG. 7: (Color online) The absolute correlation \r(zi, yk)\ between each asset and the first six PCs (k = 1, . . . , 6) as a function 
of time. Each point on the horizontal axis represents a single time window, and each position along the vertical axis represents 
an asset. 



does not explain the behavior of all asset returns [39(. 
This implies that other factors might be needed to ex- 
plain fully return time series, which has led to the de- 
velopment of models with multiple factors. One general 
model is the Arbitrage Pricing Theory (APT) [42[ , which 
provides an approximate model of expected asset returns 
using an unknown number of unidentified factors. The 
problem then becomes to identify the factors. 

The approaches for identifying factors fall into two 
basic categories: statistical and theoretical. The the- 
oretical models are based on specifying macroeconomic 
variables (such as gross national product or changes in 
bond yields [IH) or firm-specific variables (such as mar- 
ket capitalization [38| ) as factors. Relationships between 
the variables and the return time series are then often 
determined using linear regression. Statistical methods 
make no assumptions about which variables correspond 
to which factors and instead identify the factors directly 
from the return time series. Two commonly used statis- 
tical methods are factor analysis and PC A (38l. |39|. 

Many prior studies that use PCA to find factors focus 
on equity markets [HI, [39| . In this approach, the return 
time series are used to construct portfolios that represent 
factors. The PCs define different portfolios in which the 
weights of the assets are based on the PC coefficients. 



One of the issues in defining factors in this way is that 
the factors strongly depend on the time period that the 
return time series cover [3^] . 

In contrast to many factor models, we consider dif- 
ferent types of asset instead of focusing on a particular 
asset class. This does not prevent us from using PCA 
to identify portfolios of assets that represent the factors. 
However, as we showed in Fig. [7] and discussed in Sec- 
tion IVI Al wc found significant variations in the assets 
that contributed to each PC during different time peri- 
ods. For example, from 2001-2002, the first PC corre- 
sponded to bonds, whereas a wide range of different as- 
sets contributed to this component following the collapse 
of Lehman Brothers. These variations would therefore re- 
sult in significant changes in the portfolios corresponding 
to each factor, implying that the PCs do not represent 
the same factors through time. Consequently, there is 
no simple interpretation of the PCs as specific financial 
factors. 



VII. INDIVIDUAL ASSET CLASSES 

Finally, we repeat some of our analysis using correla- 
tion matrices that only include similar types of assets. 
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For some classes, we possess only a few time series, so 
we combine asset classes. We consider equities, curren- 
cies, bonds (government and corporate), and commodi- 
ties (which includes all assets categorized as metals, fuels, 
and commodities in Table IJ). 

A. Fraction of Variance for Asset Classes 

In Fig. [51 we show as a function of time the fraction 
of the variance explained by the first PC for correlation 
matrices that only include one type of asset. The changes 
in the variances for equities, commodities, and currencies 
are similar to those that we observed in Fig. [4] for all 
assets. In particular, the variance explained by the first 
PC increased from 2001 to 2010, and there was a sharp 
rise in September 2008 following the collapse of Lehman 
Brothers. In contrast, the variance in bond returns for 
which the first PC accounted decreased over the same 
period. From 2001 to 2010, the variance explained by the 
first five PCs increased from 68.6% to 89.6% for equities, 
from 44.8% to 60.3% for commodities, and from 76.1% 
to 82.1% for currencies, but it decreased from 92.4% to 
86.7% for bonds. 

The decrease in the variance of bond returns explained 
by the first PC following the collapse of Lehman Broth- 
ers implies that bonds became less correlated during this 
period. To explain this change, we consider the corre- 
lation between government bonds for the following Eu- 
ropean countries: Finland, Ireland, Greece, the Nether- 
lands, France, Austria, Belgium, Portugal, Spain, Italy, 
and Denmark. All of these countries except for Den- 
mark use the Euro, and the value of the Danish krone 
is pegged to the Euro HI]. For the period 01/08/1999- 
09/12/2008, the mean correlation between these bonds 
was 0.98 and the standard deviation was 0.01. The high 
correlation implies that the return time series for the 
different countries' bonds were very similar prior to the 
crisis. For the period 09/19/2008-01/01/2010, however, 
the mean correlation fell to 0.83 and the standard devia- 
tion was 0.11. During the crisis, financial uncertainty in- 
creased and market participants became more concerned 
about the higher default risk for particular countries [4!| . 
These concerns were heightened by the downgrading of 
the sovereign debt rating of Greece, Spain, and Portugal, 
and a flight to safer bonds, such as German bunds, in- 
creased variations in the behavior of different bonds [4!| . 
The increased variations resulted in lower correlations 
between bond returns and a reduction in the variance 
explained by the first PC. 



B. Correlations Between Assets and PCs 

We also calculate the correlations between assets and 
PCs for each asset class to determine which assets con- 
tributed to each PC. For equities, nearly all of the in- 
dices were strongly correlated with the first PC over 
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FIG. 8: (Color online) Fraction of the variance in the returns 
Z for individual asset classes explained by the first PC versus 
time. The horizontal axis shows the year of the last data point 
in each time window. 



the full period. The exceptions were the Nikkei, New 
Zealand All Ordinaries Index, Austrian Traded Index, 
and Athens General Index. However, all of these indices 
were strongly correlated with the first PC after Lehman's 
bankruptcy. The strong contribution of nearly all equi- 
ties to the first PC is consistent with prior studies in 
which this eigenvector was identified as a market factor 
that affected all stocks [J, [2(| Hl[. During some time 
periods, the second PC corresponded to a group of pe- 
ripheral Euro-zone countries (including Switzerland, Por- 
tugal, Belgium, Ireland, and Austria), whereas it corre- 
sponded to emerging market equities during other peri- 
ods. Similarly, the third PC was correlated with differ- 
ent groups of indices during different time periods. The 
higher PCs tended to be strongly correlated with sin- 
gle indices, which implies that they represented country- 
specific factors. There were frequent changes in the in- 
dices correlated with the higher PCs. This is the result 
of changes in the variance of the index returns for the 
different assets, which affect the ordering of the PCs. 

From 2001 to 2010 all of the bond indices except 
Japanese government bonds were strongly correlated 
with the first PC. During most of this period the sec- 
ond PC was correlated with corporate bonds. The assets 
that contributed to the third and fourth PCs changed 
through time. From 2001-2002 and from 2004-2010, the 
third PC corresponded to New Zealand and Australian 
government bonds and the fourth PC corresponded to 
Japanese government bonds. From 2002-2004, Japanese 
bonds were strongly correlated with the third PC and 
New Zealand and Australian bonds were strongly corre- 
lated with the fourth PC. The localization of these eigen- 
vectors implies that the PCs represented asset-specific 
factors, and the change in the identity of the two PCs is 
again the result of the changing variances of the return 
time series. As with equities, the higher PCs tended to 
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be correlated with single assets. 

The correlations between assets and PCs for commodi- 
ties and currencies were similar to those shown in Fig. [7] 
for the correlation matrices of all assets. From 2001-2002 
the first few PCs for the commodities were correlated 
with particular types of assets. For example, grain com- 
modities were correlated with the first PC and heating 
oil, crude oil, and base metals (copper, aluminum, lead, 
nickel, and tin) were correlated with the second PC. By 
2010, however, nearly all of the commodities (with the 
exception of orange juice, lumber, lean hogs, and pork 
bellies) were strongly correlated with the first PC, and 
few assets were strongly correlated with any of the other 
PCs. Similarly, during 2010, all of the currencies were 
strongly correlated with the first PC, with the exception 
of the "safe haven" currencies JPY and CHF 37 1 . 

For all of the asset classes, the low correlations be- 
tween the asset return time series and the higher PCs 
again indicate that much of the key structure from the 
correlation matrices is contained in the first few PCs. 



VIII. SUMMARY 

We used principal component analysis to investigate 
the evolving correlation structure of financial markets 
and to study common features of different markets. We 
found that the percentage of the variance in market re- 
turns explained by the first principal component steadily 
increased starting in 2006, and that there was a sharp 
rise following the 2008 collapse of Lehman Brothers. We 
also found that the number of significant components de- 
creased and that the number of assets making significant 
contributions to the first PC increased over this period. 
The strength of the correlations across asset classes fol- 
lowing Lehman's bankruptcy suggests that many differ- 
ent markets were being driven by the same macroeco- 
nomic forces during the financial crisis. We also showed, 
however, that both the proportion of the variance ex- 
plained by the first principal component and the partici- 
pation ratio of this component increased starting in 2006, 
which implies that pairwise correlations between a vari- 
ety of different assets increased for several years before 



the crisis. It is conceivable that the steady increase in cor- 
relations from 2006 might be associated with the growing 
internationalization of financial markets over this period 
[oOj , though more research would be necessary to support 
such a conclusion. 

We also investigated the time-evolving relationships 
between the different assets by investigating the corre- 
lations between the asset price time series and the first 
few PCs. From 2001 to 2002, each of the first five PCs 
corresponded to a specific market. However, after 2002, 
these relationships broke down; by 2010, nearly all of the 
assets that we studied were significantly correlated with 
the first PC. We observed similar behavior for correlation 
matrices of individual asset classes. 
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Appendix A: List of Assets 

In Table HI we provide details of all of the assets that 
we study. We selected the assets so that the data in- 
cludes price time series for all of the major markets. The 
number of assets N that we include is limited by the 
constraint that Q = T/N > 1 and the fact that the cor- 
relation coefficients arc too smoothed if T is too large. 
Because of these constraints, we use indices for some mar- 
kets, instead of individual assets, in order to obtain an 
aggregate view of the market. For all of the commodities, 
we use futures contracts because commodities are most 
widely traded in the futures market. 



TABLE I: Enumeration and details of all of the assets that we study. 



Ticker 


Asset class 


Description 


Ticker 


Asset class 


Description 


AEX 


Equities 


Netherlands AEX Index 


CADUSD 


Currencies 


Canadian dollar 


AS30 


Equities 


Australian All Ordinaries Index 


CHFUSD 


Currencies 


Swiss franc 


ASE 


Equities 


Athens General Index 


CZKUSD 


Currencies 


Czech koruna 


ATX 


Equities 


Austrian Traded Index 


EURUSD 


Currencies 


Euro 


BEL20 


Equities 


Belgium BEL 20 Index 


GBPUSD 


Currencies 


Pounds sterling 


BVLX 


Equities 


Portugal PSI General Index 


IDRUSD 


Currencies 


Indonesian rupiah 


CAC 


Equities 


France CAC 40 Index 


JPYUSD 


Currencies 


Japanese yen 


DAX 


Equities 


Germany DAX Index 


KRWUSD 


Currencies 


Korean won 


FTSEMIB 


Equities 


Italy FTSE MIB Index 


MXNUSD 


Currencies 


Mexican peso 


HEX 


Equities 


Helsinki SX General Index 


NOKUSD 


Currencies 


Norwegian krone 


HSI 


Equities 


Hong Kong Hang Seng Index 


NZDUSD 


Currencies 


New Zealand dollar 


IBEX 


Equities 


Spain IBEX 35 Index 


PHPUSD 


Currencies 


Philippines peso 



continued on next page 



12 



Ticker 


Asset class 


Description 


Ticker 


Asset class 


Description 


INDU 


Equities 


Dow Jones Industrial Average Index 


SEKUSD 


Currencies 


Swedish krona 


ISEQ 


Equit 


ies 


Irish Overall Index 


ZARUSD 


Currencies 


South African rand 


KFX 


Equit 


ies 


OMX Copenhagen 20 Index 


HG1 


Metals 


Copper 


NDX 


Equit 


ies 


NASDAQ 100 Index 


LAI 


Metals 


Aluminum 


NKY 


Equit 


ies 


Nikkei 225 Index 


LL1 


Metals 


Lead 


NZSE 


Equit 


ies 


New Zealand All Ordinaries Index 


LN1 


Metals 


Nickel 


OBX 


Equit 


ies 


Norway OBX Stock Index 


LT1 


Metals 


Tin 


OMX 


Equit 


ies 


OMX Stockholm 30 Index 


XAG 


Metals 


Silver 


RTY 


Equit 


ies 


Russell 2000 Index 


XAU 


Metals 


Gold 


SMI 


Equit 


ies 


Swiss Market Index 


XPD 


Metals 


Palladium 


SPTSX 


Equit 


ies 


S&P/Toronto SX Composite Index 


XPT 


Metals 


Platinum 


SPX 


Equit 


ies 


Standard and Poor's 500 


CL1 


Fuels 


Crude oil, WTI 


UKX 


Equit 


ies 


FTSE 100 Index 


COl 


Fuels 


Crude oil, brent 


GDDUEMEA 


Equit 


ies 


EM: Europe, Middle East, Africa 


HOI 


Fuels 


Heating oil 


GDUEEGFA 


Equit 


ies 


EM: Asia 


NG1 


Fuels 


Natural gas 


GDUEEGFL 


Equit 


ies 


EM: Latin America 


BOl 


Commodities 


Soybean oil 


ATGATR 


Gov. 


bonds 


Austria 


C 1 


Commodities 


Corn 


AUGATR 


Gov. 


bonds 


Australia 


CC1 


Commodities 


Cocoa 


BEGATR 


Gov. 


bonds 


Belgium 


CT1 


Commodities 


Cotton 


CAGATR 


Gov. 


bonds 


Canada 


FC1 


Commodities 


Coffee 


DEGATR 


Gov. 


bonds 


Denmark 


JN1 


Commodities 


Feeder cattle 


FIGATR 


Gov. 


bonds 


Finland 


JOl 


Commodities 


Orange juice 


FRGATR 


Gov. 


bonds 


France 


KC1 


Commodities 


Coffee 


GRGATR 


Gov. 


bonds 


Germany 


LB1 


Commodities 


Lumber 


IEGATR 


Gov. 


bonds 


Ireland 


LCI 


Commodities 


Live cattle 


ITGATR 


Gov. 


bonds 


Italy 


LH1 


Commodities 


Lean hogs 


JNGATR 


Gov. 


bonds 


Japan 


O 1 


Commodities 


Oats 


NEGATR 


Gov. 


bonds 


Netherlands 


PB1 


Commodities 


Frozen pork bellies 


NOGATR 


Gov. 


bonds 


Norway 


QW1 


Commodities 


Sugar 


NZGATR 


Gov. 


bonds 


New Zealand 


RR1 


Commodities 


Rough rice 


PTGATR 


Gov. 


bonds 


Portugal 


S 1 


Commodities 


Soybean 


SPGATR 


Gov. 


bonds 


Spain 


SMI 


Commodities 


Soybean meal 


BWbAlR 


Gov. 


bonds 


Sweden 


W 1 


Commodities 


Tin j 

Wheat 


SZGATR 


Gov. 


bonds 


Switzerland 


MOODCAAA 


Corp. bonds 


Moody's AAA rated 


UKGATR 


Gov. 


bonds 


U.K. 


MOODCAA 


Corp. bonds 


Moody's AA rated 


USGATR 


Gov. 


bonds 


U.S. 


MOODCA 


Corp. bonds 


Moody's A rated 


AUDUSD 


Currencies 


Australian dollar 


MOODCBAA 


Corp. bonds 


Moody's BAA rated 



Appendix B: Example Correlations Betweens Assets 
and Components 

In Fig. [9J we show example plots of the absolute corre- 
lation |r(zj,yfc)| as a function of time for specific assets. 
These figures correspond to horizontal slices through the 
plots shown in Fig. [7] Figure [5] highlights that there 
are many time steps at which the absolute correlation 
between the PCs and asset return time series are sig- 
nificantly larger than the values for random matrices. 
For example, the correlation between Danish government 
bonds (DEGATR) and the first PC exceeds 0.9 at some 
time steps and is above the 99 th percentile for random 
matrices at every time step until the collapse of Lehman 
Brothers. After Lehman's collapse, the correlation of DE- 
GATR with the first PC falls sharply, but its correlation 
with the second PC increases to approximately 0.8. We 
observe similar behavior for AA-rated corporate bonds 
(MOODCAA), but there is also a period in 2003 dur- 



ing which corporate bonds were significantly correlated 
with the second PC. We can make similar observations 
for assets from other classes. For example, the absolute 
correlations of the AUDUSD exchange rate and oil fu- 
tures (COl) with the first PC are both significant after 
the collapse of Lehman Brothers; the correlation of gold 
(XAU) with the third PC is significant over the same pe- 
riod; and the correlation of the price of soybean futures 
(S 1) is initially significantly correlated with the fourth 
PC and then becomes significantly correlated with the 
first PC. Figure IH1 demonstrates that by considering the 
changes in the magnitudes of the correlations between 
the PCs, we can gain insights into the changes taking 
place in markets. 



Appendix C: Contribution of Assets to the 
Correlations 

The PCs are defined as linear combinations of the asset 
return time series %i [see Eq. (|3J)]. Because the assets con- 
tribute to the PCs, some of the correlation [r(z,,yfc)| be- 
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FIG. 9: Examples of the absolute correlation |r(zt,yfc)| between various assets and the first five principal components (k = 
1,...,5) as a function of time. We show Danish government bonds (DEGATR), AA-rated corporate bonds (MOODCAA), 
Nikkei 225 (NKY), gold (XAU), oil (COl), and soybean futures (S 1). The first row shows correlations for the first PC, the 
second row for the second PC, and so on. The horizontal red lines show the values of the 99 th percentile of the distribution of 
absolute correlation coefficients for the corresponding PC for random data. As we increase k from 1 to 5, the 99 th percentile 
of |r(zi,yfe)| for random matrices decreases from 0.47 to 0.43. 



tween the i asset and the k PC is attributable to asset 
i. To understand the effect of these self-correlations, we 
calculate the correlation between the return time series 
Zj and each of the PCs with the contribution from the i th 
asset removed. For each asset i, we define k = 1, . . . , N 
adjusted PCs w^. with elements given by 

N 

w k (t) = J2^uZi(t) (CI) 

and calculate the absolute correlation |r(zj,Wfc)| between 
each asset and the adjusted PCs. 

In Fig. [TU1 we compare the distribution of the differ- 



ences between the absolute correlations \r(z i: yk)\ (be- 
tween the assets and the PCs) and the correlations 
|r(zj,Wfc)| (between the assets and the adjusted PCs). 
The correlations are very similar for k = 1, which im- 
plies that the self-correlations do not dominate the cor- 
relations between PCs and assets. However, for larger 
k, the differences between |j"(zj,yfc)| and |r(zi, Wfc)| be- 
come more significant. This is expected because fewer 
assets make significant contributions to the higher PCs, 
so the removal of a single asset can have a larger effect. 
This highlights the fact that many assets contribute to 
the lower PCs, whereas the higher PCs are localized and 
have only a few assets contributing to them. 
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play "volatility clustering", which means that large- 
magnitude returns tend to be followed by large- 
magnitude returns and small-magnitude returns tend to 
be followed by small-magnitude returns [47ll4Sl|. The ran- 
dom return time series that we investigate do not pos- 
sess this property; moreover, such memory of volatility 
in individual time series would not affect the results. We 
consider zero-lag correlations and the zero-lag autocor- 



relation is necessarily equal to 1 for any time series [3^ ]. 
Given this, the results for an investigation of correlations 
at zero lag for mutually uncorrelated time series with 
memory must be the same as the results for mutually 
uncorrelated time series without memory. 
[52] The peg between the Danish krone and the Euro means 
that the value of the krone is matched to the value of the 
Euro. 



